Publication and use of large data sets

نویسنده

  • John Rumble
چکیده

Scientific information comes in many sizes, types and levels of quality. Because of the diversity of the scientific information being published, different issues will arise in publishing different types of information electronically. In this paper, we will address issues related to electronic publication of large scientific data sets, a subset of scientific information often overlooked in discussions on electronic scientific publications. First, we establish the parameters that define large scientific data sets. Then we identify examples from a variety of scientific disciplines. Large data sets (LDS) require special technology for their creation and management, and that technology is briefly described, as well as traditional publication and use of LDS. We discuss electronic publication of large scientific data sets and their uses as they exist today. Finally, we look into the future of electronic publication of LDS, including issues such as intellectual property rights (IPR), and LDS as a source of new discovery and economics.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application of Benford’s Law in Analyzing Geotechnical Data

Benford’s law predicts the frequency of the first digit of numbers met in a wide range of naturally occurring phenomena. In data sets, following Benford’s law, numbers are started with a small leading digit more often than those with a large leading digit. This law can be used as a tool for detecting fraud and abnormally in the number sets and any fabricated number sets. This can be used as an ...

متن کامل

Spatial Design for Knot Selection in Knot-Based Low-Rank Models

‎Analysis of large geostatistical data sets‎, ‎usually‎, ‎entail the expensive matrix computations‎. ‎This problem creates challenges in implementing statistical inferences of traditional Bayesian models‎. ‎In addition,researchers often face with multiple spatial data sets with complex spatial dependence structures that their analysis is difficult‎. ‎This is a problem for MCMC sampling algorith...

متن کامل

کاربرد روش‌های شناسایی تورش انتشار برای فراتحلیل در ارزیابی تاثیر داروی آلبندازول در درمان مبتلایان به آسکاریس و تریکوسفال

 Background : Meta analysis is a statistical method to combine the findings of a set of large number of published individual studies and re-analyse them. The use of meta-analysis methods in medical research has been increased, noticeably, in resent years. However, one of the major shortcomings in such analysis is that the researcher, could not access all conducted studies in the area of concern...

متن کامل

Misconduct in Research and Publication

Dear Editor, I read the recent publication on “Misconduct in Research and Publication” with great interest[1]. I agree that misconduct in research and publication is not uncommon. Nevertheless, it is rarely mentioned. In fact, there are many incorrect conceptions among researchers on publication ethics. The milder examples are attempts to report only the “positive outcomes&rdq...

متن کامل

Examining University Students' Scholarly Publication in English Journals: A Case for Postgraduate Students' Written Literacy Practices

This  research  aimed  to  screen  'essay  writing'  difficulties  that  non-native  university students  at  postgraduate  levels  usually  experience  regarding  scholarly  publication  in mainstream, English journals. Two sets of variables including written literacy competencies in Persian and English languages were mapped over language uses (General vs. Academic). Initial screenings  from  ...

متن کامل

Selection of Variables that Influence Drug Injection in Prison: Comparison of Methods with Multiple Imputed Data Sets

Background: Prisoners, compared to the general population, are at greater risk of infection. Drug injection is the main route of HIV transmission, in particular in Iran. What would be of interest is to determine variables that govern drug injection among prisoners. However, one of the issues that challenge model building is incomplete national data sets. In this paper, we addressed the process ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001